首页> 外文OA文献 >A decomposition approach for undiscounted two-person zero-sum stochastic games
【2h】

A decomposition approach for undiscounted two-person zero-sum stochastic games

机译:无折扣两人零和随机游戏的分解方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Two-person zero-sum stochastic games are considered under the long-run average expected payoff criterion. State and action spaces are assumed finite. By making use of the concept of maximal communicating classes, the following decomposition algorithm is introduced for solving two-person zero-sum stochastic games: First, the state space is decomposed into maximal communicating classes. Then, these classes are organized in an hierarchical order where each level may contain more than one maximal communicating class. Best stationary strategies for the states in a maximal communicating class at a level are determined by using the best stationary strategies of the states in the previous levels that are accessible from that class. At the initial level, a restricted game is defined for each closed maximal communicating class and these restricted games are solved independently. It is shown that the proposed decomposition algorithm is exact in the sense that the solution obtained from the decomposition procedure gives the best stationary strategies for the original stochastic game.
机译:在长期平均预期收益标准下,考虑了两人零和随机游戏。状态和动作空间被假定为有限的。利用最大通信类的概念,引入了以下分解算法来求解两人零和随机游戏:首先,将状态空间分解为最大通信类。然后,这些类按层次结构顺序组织,其中每个级别可以包含一个以上的最大通信类。通过使用该级别可访问的先前级别中状态的最佳静态策略,可以确定某个级别的最大通信类中状态的最佳静态策略。在初始级别,为每个封闭的最大交流类定义一个受限游戏,并且这些受限游戏是独立解决的。结果表明,从分解过程中获得的解给出了原始随机博弈的最佳平稳策略的意义上,所提出的分解算法是准确的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号